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NITRILASE HOMOLOGS 



5 FIELD OF THE INVENTION 



The present invention generally relates to the field of oncology and tumor 
suppressor genes, and more particularly to the structure and function of the NITl 
gene, the structure of its encoded proteins, and the use of NITl genes and the NITl 
10 related genes and their encoded proteins and vectors containing the NITl coding 
sequence as diagnostic and therapeutic reagents for the detection and treatment of 
cancer. 



15 BACKGROUND OF THE INVENTION 



Introduction 



The present invention relates to nucleotide sequences of the NITl gene and 
20 amino acid sequences of its encoded proteins, as well as derivatives and analogs 
thereof. Additionally, the present invention relates to the use of nucleotide 
sequences of NITl genes and amino acid sequences of their encoded proteins and 
vectors containing the NITl coding sequence, as well as derivatives and analogs 
thereof and antibodies thereto, as diagnostic and therapeutic reagents for the 
25 detection and treatment of cancer. The present invention also relates to therapeutic 
compositions comprising Nitl proteins, derivatives or analogs thereof, antibodies 
thereto, nucleic acids encoding the Nitl proteins, derivatives, or analogs, and NITl 
antisense nucleic acids, and vectors containing the NITl coding sequence. 



30 



1 
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Approaches to Elucidation and Characterization of NIT 1 

The tumor suppressor gene FHIT encompasses the common human 
5 chromosomal fragile site at 3pl4.2 and numerous cancer cell bi-allelic deletions. 
To study Fhit function, Fhit genes in Z). melanogaster and C elegans were cloned 
and characterized. The Fhit genes in both of these organisms code for fusion 
proteins in which the Fhit domain is fused with a novel domain showing homology 
to bacterial and plant nitrilases; the D, melanogaster fusion protein exhibited 
10 diadenosine triphosphate (ApppA) hydrolase activity expected of an authentic Fhit 
homolog. 

In human and mouse, the nitrilase homologs and Fhit are encoded by two 
different genes, FHIT and NITl^ localized on chromosomes 3 and 1 in human, and 
14 and 1 in mouse, respectively. Human and murine NITl genes were cloned and 

15 characterized, their exon-intron structure, their patterns of expression, and their 
alternative mRNA processing were determined. 

The tissue specificity of expression of murine FHIT and NITl genes was 
nearly identical. Typically, fusion proteins with dual or triple enzymatic activities 
have been found to carry out specific steps in a given biochemical or biosynthetic 

20 pathway; Fhit and Nitl, as fusion proteins with dual or triple enzjnnatic activities, 
likewise collaborate in a biochemical or cellular pathway in mammalian cells. 

Importance of FHIT 

25 The human FHIT gene at chromosome 3pl4.2, spanning the constitutive 

chromosomal fragile site FRA3B, is often altered in the most common forms of 
human cancer and is a tumor suppressor gene. The human FHIT gene is greater 
than one megabase in size encoding an mRNA of 1 . 1 kilobases and a protein of 
147 amino acids. 

30 The rearrangements most commonly seen are deletions within the gene. 

These deletions, often occurring independently in both alleles and resulting in 
inactivation, have been reported in tumor-derived cell lines and primary tumors of 
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lung, head and neck, stomach, colon, and other organs. In cell lines derived from 
several tumor types, DNA rearrangements in the FHIT locus correlated with RNA 
and/or Fhit protein alterations. 
5 Because the inactivation of the FHIT gene by point mutations has not been 

demonstrated conclusively and because several reports have shown the 
amplification of aberrant-sized FHIT reverse transcription-PCR (RT-PCR) 
products from normal cell RNA, a number of investigators have suggested that the 
FHIT gene may not be a tumor suppressor gene. On the other hand it has been 
10 reported that re-expression of Fhit in lung, stomach and kidney tumor cell lines 
lacking endogenous protein suppressed tumorigenicity in vivo in 4 out of 4 cancer 
cell lines. This suggests that FHIT is indeed a tumor suppressor gene. It is noted 
that a report has suggested that Fhit enzymatic activity is not required for its tumor 
suppressor function. 

15 Fhit protein is a member of the histidine triad (HIT) superfamily of 

nucleotide binding proteins and is similar to the Schizosaccharomyces pombe 
diadenosine tetraphosphate (AP4A) hydrolase. Additionally it has been reported 
that, in vitro, Fhit has diadenosine triphosphate (ApppA) hydrolase enzymatic 
activity. 

20 Neither the in vivo fimction of Fhit nor the mechanism of its tumor 

suppressor activity is known. Nonetheless, genetic, biochemical and 
crystallographic analysis suggest that the enzyme-substrate complex is the active 
form that signals for tumor suppression. One approach to investigate function is to 
investigate Fhit in model organisms such as Drosophila melanogaster and 

25 Caenorhabditis elegans. 

The present invention involves the isolation and characterization of the 
NITl gene in these organisms. Fhit occurs in a fusion protein, Nit-Fhit, in D. 
melanogaster and C. elegans, but FHIT and NITl are separate genes in 
mammalian cells. The human and mouse NITl genes are members of an 

30 uncharacterized mammalian gene family with homology to bacterial and plant 
nitrilases, enzymes which cleave nitriles and organic amides to the corresponding 
carboxylic acids plus ammonia. 
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SUMMARY OF THE INVENTION 

Accordingly, it is an object of the present invention to purify a NITl gene. 
5 It is a further object of the present invention to purify a NITl gene, wherein 

the purified gene is a human gene. 

It is an object of the present invention to purify a NITl gene, wherein the 
purified gene is a mammalian gene. 

It is an object of the present invention to purify a Nitl protein. 
10 It is another object of the present invention to purify a Nitl protein, 

wherein the purified protein is a human protein. 

It is another object of the present invention to purify a Nitl protein, 
wherein the purified protein is a mammalian protein. 

Yet another aspect of the present invention is a purified protein encoded by 
15 a nucleic acid having a nucleotide sequence consisting of the coding region of 
SEQ IDNO:l (Figure 6). 

Another aspect of the present invention is an antibody capable of binding a 
Nitl protein. 

It is another object of the present invention to isolate a nucleic acid of less 
20 than 100 kb, comprising a nucleotide sequence encoding a Nitl protein. 

Another object of the present invention is a pharmaceutical composition 
comprising a therapeutically effective amount of a Nitl protein; and a 
therapeutically acceptable carrier. 

Another object of the present invention is a method of treating or 
25 preventing a disease or disorder in a subject comprising administering to said 
subject a therapeutically effective amount of a molecule that inhibits Nitl function. 

Another aspect of the present invention is a method of treating or 
preventing a disease or disorder in a subject comprising administering to said 
subject a therapeutically effective amount of a molecule that enhances Nitl 
30 function. 

It is yet another aspect of the present invention to diagnose or screen for the 
presence of or a disposition for developing a disease in a subject, comprising 
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detecting one or more mutations in NITl DNA, RNA or Nitl protein derived from 
the subject in which the presence of said one or more mutations indicates the 
presence of the disease or disorder or a predisposition for developing the disease or 
5 disorder. 

It is yet another aspect of the present invention to treat a disease or disorder 
with a vector containing the coding segment of the NITl gene. 



10 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig, 1. A sequence comparison of human, murine, D. melanogaster, and C 
elegans Nitl and Fhit proteins. Identities are shown in black boxes, similarities 
are shown in shaded boxes. For human and mouse FHIT GenBank accession 
1 5 numbers are U46922 and AF047699, respectively. 

Fig, 2. Northem blot analysis of expression of NITl and Fi//r mRNAs in 
murine and human tissues, as well as in D. melanogaster, and C, elegans, (A) 
Mouse multiple tissues Northem blot. Lanes 1-8: heart, brain, spleen, lung, liver, 
skeletal muscle, kidney, and testis. (Top) Fhit probe; (Middle) Nitl probe; 
20 (Bottom) actin probe. (B) Human blot, NITl probe. Lanes 1-8: heart, brain, 
placenta, lung, liver, skeletal muscle, kidney, and pancreas. (C) Lanes 1 and 2: D. 
melanogaster adult, D. melanogaster embryo; D. melanogaster Nit-Fhit probe. 
Lane 3: C. elegans adult; C. elegans Nit-Fhit ipvohc. 

Fig. 3. Genomic organization of human and murine NITl genes and D. 
25 melanogaster and C, elegans Nit-Fhit genes. (A) Exon-intron structure of the 
genes. (B) Alternative processing of human MTV gene. 

Fig. 4. Cleavage of ApppA by Z). melanogaster Nit-Fhit, At indicated 
times of incubation, samples were spotted on TLC plates with appropriate 
nucleotide standards. 

30 Fig. 5, Analysis of alternative transcripts of human NITl by RT-PCR. RT- 

PCR of HeLa RNA was performed with primers in different exons. Lanes 1-6: 
exons 1 and 3 (transcript 2); exons IC and 3 (transcript 5); exons 1 A and 3 



CRO01.NP001 



PATENT 



(transcripts 3, upper band and 4, lower band): exons 2 and 3 (transcripts 2-4); 
exons 1 and IC (transcript 5); and exons 1 and 2 (transcript 2). 

Fig. 6. Highly conserved sequence of human, murine, D. melanogaster, 
5 and C. elegans NITl gene. (SEQ ID NO: 1), 

DETAILED DESCRIPTION 

10 Genomic and cDNA clones 

One million plaques of a mouse genomic library (bacteriophage library 
from strain SVJ129, Stratagene, La JoUa, CA) and one hundred thousand plaques 
of a D. melanogaster genomic library were screened with corresponding cDNA 

15 probes. Clones were purified and DNA was isolated. Sequencing was carried out 
using Perkin Elmer thermal cyclers and ABI 377 automated DNA sequencers. 
DNA pools from a human BAC library (Research Genetics, Huntsville, AL) were 
screened by PCR with NITl primers (TCTGAAACTGCAGTCTGACCTCA (SEQ 
ID N0:2) and CAGGCACAGCTCCCCTCACTT (SEQ ID NO:3)) according to 

20 the supplier's protocol. The DNA from the positive clone, 31K11, has been 
isolated using standard procedures and sequenced. Chromosomal localization of 
the human NITl gene was determined using a radiation hybrid mapping panel 
(Research Genetics) according to the supplier's protocol and with the same 
primers as above. To map murine Nitl gene, Southern blot analysis of genomic 

25 DNA from progeny of a (AEJ/Gn-a bp^/a bp^ x M spretus)Fl x AEJ/Gn-a bp^/ a 
bp^ backcross was performed using a full length murine Nitl cDNA probe. This 
probe detected a unique 2.0 kb Dral fragment in AEJ DNA and a unique 0.75 kb 
fragment in M spretus DNA. Segregation of these fragments were followed in 
180 N2 offspring of the backcross. Additional Mit markers {DlMit34, DlMitSS, 

30 and DlMit209) were typed from DNA of 92 mice by using PCR consisting of an 
initial denaturation of 4 minutes at 94°C followed by 40 cycles of 94°C for 30 
seconds, SS'^C for 30 seconds and 72°C for 30 seconds. Linkage analysis was 

6 
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performed using the computer program SPRETUS MADNESS: PART DEUX, 
Human and mousQNITl expressed sequence tag (EST) clones were purchased 
form Research Genetics. The sequences of human and murine NITl genes and 
5 cDNAs and D. melanogaster and C. elegans Nit-Fhit cDNAs have been deposited 
in GenBank. 

In situ hybridization 

10 D, melanogaster polytene chromosome spreads were prepared from 

salivary glands of third-instar larvae as described. NitFhit DNA fragments were 
labeled with digoxigenin-ll-dUTP using a random-primed DNA labeling kit 
(Boeringer Mannheim, Indianapolis, IN), and were used as probes for the 
chromosomal in situ hybridization. Hybridization was for 20 hours at 37°C in 

15 hybridization buffer: 50% formamide, 2x standard saline citrate (SSC), 10% 
dextran sulfate, 400 mg/ml salmon sperm DNA, Antidigoxigenin-fluorescein 
antibodies (Boehringer Mannheim) were used for detection of hybridizing regions, 
DNA was counterstained with Hoechst 33258 (Sigma, St. Louis, MO). The sUdes 
were analyzed by fluorescence microscopy. For in situ hybridization, embryos 

20 were fixed and processed as described previously, except that single-stranded 
RNA probes were used. Full length NitFhit cDNA was cloned into Bluescriptll 
KS+ vector and used to synthesize antisense RNA probes with the Genius 4 kit 
(Boehringer Mannheim). 

25 RT-PCR, Northern and RACE analysis 

Human and mouse multiple tissue northem blots (Clontech, Palo Alto, CA) 
were hybridized with corresponding NITl cDNA probes and washed using the 
supplier's protocol. For the HeLa cell line, total RNA was isolated from 1-5 x 10^ 
30 cells using Trizol reagent (Gibco BRL, Gaithersburg, MD). D. melanogaster 
PolyA+ RNA was purchased from Clontech. Three |iig of polyA+ RNA or 15 |ag 
of total RNA were electrophoresed in 0.8% agarose in a borate buffer containing 

7 
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formaldehyde, transferred to HybondN+ membrane (Amersham, Arlington 
Heights, IL) using standard procedures and hybridized as described above. For 
RT-PCR, 200 ng of polyA+ RNA or 3 \xg of total RNA were treated with DNasel 
5 (amplification grade, Gibco BRL) following the manufacturer's protocol. DNase- 
treated RNA was used in reverse transcription (RT) reactions as follows: 10 nM 
each dNTP, 100 pmoles random hexamers (ohgo (dX) priming was used in some 
cases), DNasel treated RNA, and 200 units of murine leukemia virus (MuLV) 
reverse transcriptase (Gibco BRL), in total volume of 20 |il were incubated at 

10 42°C for 1 hour followed by the addition of 10 i^g RNase A and incubation at 
37°C for 30 min. One [il of the reaction was used for each PGR reaction. PGR 
reactions were carried out under standard conditions using 10 pmoles of each 
gene-specific primer and 25-35 cycles of 95° 30", 55-60° 30", 72° 1'. Products 
were separated on 1.5% agarose gels and sometimes isolated and sequenced or 

15 cloned and sequenced. Oligo (dT)-primed double-stranded cDNA was synthesized 
by using procedures and reagents from the Marathon RAGE cDNA amplification 
kit (Glontech); the cDNA was hgated to Marathon adapters (Glontech). 3' and 5' 
RAGE products were generated by long PGR using gene-specific primers and the 
API primer (Glontech). To increase the specificity of the procedure, the second 

20 PGR reaction was carried out by using nested gene-specific primers and the AP2 
primer (Glontech). PGR reactions were performed according to the Marathon 
protocol using the Expand long template PGR system (Boehringer Mannheim) and 
30 cycles of: 94° 30", 60° 30", 68° 4'. RAGE products were electrophoresed, 
identified by hybridization and sequenced. Degenerate FHIT primers were: 

25 GTNGTNGGNGGNGAYGTNGT (SEQ ID NO:4) and 

AGRTGNAGRTGYTTNAGNGTYTGNGG (SEQ ID NO:5). D. Melanogaster 
Fhit RAGE and RT-PGR primers were: GGGGGTTTGTGGGGTGGAGTG (SEQ 
ID NO:6) and GGGTGGGGGAAGTTGTGTGGT (SEQ ID NO:7). C. elegans 
Fhit RAGE and RT-PGR primers were: GTGGGGGGTGGTGAAAGTGG (SEQ 

30 ID NO:8) and TGGGGAGGATGAACAAGTGGG (SEQ ID NO:9). Human NITl 
RT-PGR primers were: GGGGTGGGGATGGGAGGGT (SEQ ID NO: 10) (exon 
1); GAGGTAGTGGGTATGGGGTG (SEQ ID N0:1 1) (exon la); 
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GCTGCGAAGTGCACAGCTAAG (SEQ ID NO: 12) and 

AAACTGAAGCCTCTTTCCTCTGAC (SEQ ID NO: 13) (exon Ic); 

TGGGCTTCATCACCAGGCCT (SEQ ID NO: 14) and 

5 CTGGGCTGAGCACAAAGTACTG (SEQ ID NO: 15) (exon 2); 
GCTTGTCTGGCGTCGATGTTA (SEQ ID NO: 16) (exon 3). 

Protein expression and enzymatic characterization 

10 The NIT-FHIT cDNA was amplified with primers 

TGACGTCGACATATGTCAACTCTAGTTAATACCACG (SEQ ID NO: 17) and 
TGGGTACCTCGACTAGCTTATGTCC (SEQ ID NO: 18), digested with Ndel 
and Kpnl, and cloned into plasmid pSGA02 as a Ndel-Kpnl fragment. 
Escherichia coli strain SGI 00 transformants were grown in Luria-Bertani with 

15 100 |ig/ml of ampicillin and 15 jig/ml of chloramphenicol at 15°C. When the 
culture reached an optical density (600 nm) of 0.25, isopropyl 6-D-thiogalactoside 
was added to a final concentration of 200 ^M. NitFhit protein was purified firom 
inclusion bodies as described. Briefly, the cell pellet from a 1 -liter culture was 
resuspended in 50 ml of 20 mM Tris«HCl (pH 7.5), 20% sucrose, ImM EDTA and 

20 repelleted. Outer cell walls were lysed by resuspension in ice- water. Spheroblasts 
were pelleted, resuspended in 140 mM NaCl, 2.7 mM KCl, 12 mM Na»P04 (pH 
7.3), 5mM EDTA, 500mM phenylmethylsulfonyl fluoride, 1 |ag/ml leupeptin and 
20 ng/ml of aprotinin, and sonicated. The resulting inclusion body preparation 
was washed and solubilized in 5 M guanidinium hydrochloride, 50mM Tris'HCl 

25 (pH 8.0), 5mM EDTA. Soluble NitFhit protein was added dropwise to 250ml of 
50mM Tris'HCl (pH 8.0), ImM DTT, 20% glycerol at 40°C. After a 14 hour 
incubation, the 13 -kg supernatant was concentrated 100-fold with a Centricon 
filter. A 1 -liter culture yielded approximately 200 |ig of partially purified, soluble 
NitFhit. ApppA hydrolase activity was assayed at 30°C in 20 yl of 50mM 

30 Na-HEPES pH 7.5, 10% glycerol, 0.5 mM MnC12, 4mM ApppA, 1 ^M NitFhit. 
TLC plates were developed as described. 



9 
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Cloning and characterization of D. melanogaster and C. elemns 

Fhit homoloss 

5 To obtain D. melanogaster Fhit sequences, degenerate primers were 

designed in the conserved regions of exons 5 and 7 of human FHIT. RT-PCR 
experiments with these primers and D. melanogaster RNA resulted in an -200 bp 
product, which when translated showed -50% identity to human Fhit protein. This 
sequence was used to design specific D. melanogaster Fhit primers, 5' and 3' 

10 RACE with these primers resulted in -1.5 kb full length cDNA (including 
polyadenylation signal and Poly(A) tail) encoding a 460 amino acid protein with a 
145 amino acid C-terminal part homologous to human Fhit (40% identity and 47% 
similarity) and a 315 amino acid N-terminal extension (Fig. 1). Northern analysis 
(Fig. 2C) showed a singer band of --1.5 kb in both embryo and adult D, 

1 5 melanogaster confirming that the full length cDNA has been cloned. 

The 460 amino acid predicted protein sequence was used in a BLASTP 
search. Of the top 50 scoring ahgnments, 22 aligned with the 145 residue C- 
terminal segment (Fhit-related sequences) and 28 aligned with the 315 residue N- 
terminal segment. The 28 sequences aligning with the N-terminus were led by an 

20 uncharacterized gene from chromosome X of Saccharomyces cerevisiae (P-value 
of 1.4 X 10"^^), followed by uncharacterized ORFs of many bacterial genomes and 
a series of enzymes from plants and bacteria that have been characterized as 
nitrilases and amidases. Thus, the 460 amino acid predicted protein contains an N- 
terminal nitrilase domain and a C-terminal Fhit domain and was designated 

25 NitFhit. 

The D, melanogaster Nit-Fhit cDNA probe was used to screen a D. 
melanogaster lambda genomic library. Sequencing of positive clones revealed 
that the gene is intronless and, interestingly, the 1.5-kb Nit-Fhit gene is localized 
within the 1.6-kb intron 1 of the D. melanogaster homolog of the murine glycerol 
30 kinase (Gyk) gene. The direction of transcription of the Nit-Fhit gene is opposite 
to that of the Gyk gene (Fig. 3A). It is not knovm if such localization affects 
transcriptional regulation of these two genes. 

10 
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The cytological position of the Nit-Fhit gene was determined by in situ 
hybridization to sahvary gland polytene chromosomes. These experiments 
showed that there is only one copy of the sequence which was localized to region 
5 61 A, at the tip of the left arm of chromosome 3. Digoxigenin-labeled RNA probes 
were hybridized to whole-mount embryos to determine the pattem of expression 
during development. Nit-Fhit RNA was uniformly expressed throughout the 
embryo suggesting that NitFhit protein could be important for most of the 
embryonic cells. 

10 Because human Fhit protein and the D, melanogaster Fhit domain were 

only 40% identical, to show that the authentic D. melanogaster Fhit homolog was 
cloned, its enzymatic activity was tested. Fig. 4 shows that recombinant Z). 
melanogaster NitFhit is capable of cleaving ApppA to AMP and ADP and 
therefore possesses ApppA hydrolase activity. 

15 

C. elegans 

Fhit genomic sequences were obtained from the Sanger database (contig 
Y56A3) by using BLAST searches. 5' and 3' RACE with C. elegans Fhit specific 

20 primers yielded a 1.4-kb cDNA (including polyadenylation signal and Poly(A) 
tail) coding for a 440 amino acid protein (Fig, 1). Northern analysis (Fig. 2C) 
showed a single band of a similar size in adult worms. Similarly to D. 
melanogaster, the C. elegans protein contained an N-terminal nitrilase domain and 
a C-terminal Fhit domain (Fig. 1) with 50% identity and 57% similarity to human 

25 Fhit. Comparison between C. elegans Nit-Fhit cDNA and genomic sequences 
from the Sanger database revealed that the C. elegans Nit-Fhit gene comprises 8 
exons and is more than 6.5 kb in size (Fig. 3A); the nitrilase domain is encoded by 
exons 1-6, and the Fhit domain is encoded by exons 6-8. Z). melanogaster and C 
elegans NitFhit proteins are 50% identical and 59% similar and exhibit several 

30 conserved domains (Fig. 1). 
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Clonins and characterized of human and murine NIT cDNAs and series 

Because Fhit and nitrilase domains are part of the same polypeptides in D. 
5 melanogaster and C elegans, it is reasonable to suggest that they may be involved 
in the same biochemical or cellular pathway(s) in these organisms. Because 
nitrilase homologs are conserved in animals, the mammalian nitrilase homologs 
were cloned as candidate Fhit-interacting proteins. 

To obtain human and murine NITl sequences, the D, melanogaster nitrilase 

10 domain sequence was used in BLAST searches of the GenBank EST database. 
Numerous partially sequenced human and murine NITl ESTs were found. All 
mouse Nitl ESTs were identical, as were all human NITl ESTs, suggesting the 
presence of a single NITl gene in mouse and human. To obtain the full-length 
human and mouse cDNAs, several human and mouse ESTs and human 5' and 3' 

15 RACE products were completely sequenced. This resulted in the isolation of a 
-1.4-kb full-length human sequence encoding 327 amino acids and a -1.4-kb 
mouse full-length sequence coding for 323 amino acids (Fig. 1), although several 
altematively spliced products were detected in both cases (see below and Fig. 3B). 
Both cDNAs are polyadenylated, but lack polyadenylation signals, although AT- 

20 rich regions are present at the very 3' end of each cDNA, Mouse and human Nitl 
amino acid sequences were 90% identical; the human Nitl amino acid sequence 
was 58% similar and 50% identical to the C elegans nitrilase domain and 63% 
similar and 53% identical to the D. melanogaster nitrilase domain (Fig. 1). 

Murine lambda and human BAC genomic libraries were screened with the 

25 corresponding NITl cDNA probes, yielding one mouse lambda clone and one 
human BAC clone containing the NITl genes. The human and murine NITl 
genomic regions were sequenced and compared to the corresponding cDNA 
sequences. The genomic structure of human and mouse NITl genes is shown in 
Fig. 3 A. Both genes are small: the hximan gene is -3.2 kb in size and contains 7 

30 exons; the murine gene is -3.6 kb in size and contains 8 exons. Southern analysis 
confirmed that both hxraian and mouse genomes harbor a single NITl gene. 
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A radiation hybrid mapping panel (GeneBridge 4) was used to determine 
the chromosomal localization of the human NITl gene. By analysis of PCR data at 
the Whitehead/MIT database (http://www-genome.wi,rait.edu), the NITl gene was 
5 localized 6.94 cR from the marker CHLC.GATA43A04, which is located at lq21- 
lq22. 

A full length murine Nitl cDNA probe was used to determine the 
chromosomal location of the murine gene by linkage analysis. Interspecific 
backcross analysis of 180 N2 mice demonstrated that the Nitl locus cosegregated 

1 0 with several previously mapped loci on distal mouse chromosome 1 . The region 
to which Nitl maps was further defined by PCR of genomic DNA from 92 N2 mice 
using the markers DlMit34, DlMit35 and DlMit209 (Research Genetics). The 
following order of the genes typed in the cross and the ratio of recombinants to N2 
mice was obtained: centromere - DlMit34 - 7/78 - DlMit35 - 8/90 - Nitl - 1 1/91- 

15 DlMit209 - telomere. The genetic distances given in centiMorgans (±S.E,) are as 
follows: centromere - DlMit209 - 9.0 ± 3.2 - DlMit35 - 8.9 ± 3.0 - Nitl - 12.1 
±3.4 - DlMit209 - telomere. This region of mouse chromosome 1 (lq21 - lq23) 
is syntenic to human chromosome Iq and is consistent with the localization of the 
human ortholog of NitL 

20 

Expression and alternative svlicins of human and murine Nitl senes 

For the human gene, Northern analysis revealed two major transcripts of 
-1.4 kb and -2.4 kb in all adult tissues and tumor cell lines tested. A third band of 

25 -1.2 kb was observed in adult muscle and heart (Fig. 2B). The longest cDNA 
(-1.4 kb) corresponds to the -1.4-kb transcript observed on Northem blots. The 
1.2-kb band corresponds to transcript 1 on Fig. 3B (see below). It is not known if 
the -2.4-kb RNA represents an additional transcript or an incompletely processed 
mRNA. No significant variation in human NITl mRNA levels was observed in 

30 different tissues (Fig. 2B), On the contrary, different mouse tissues showed 
different levels of expression of A^zY7 mRNA (Fig. 2 A). The highest levels of Nitl 
mRNA were observed in mouse liver and kidney (Fig. 2 A, Middle, lanes 5 and 7). 
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Interestingly, the pattern of Nitl expression was almost identical to the pattern of 
the expression of Fhit (Fig. 2A, Top and Middle), supporting the hypothesis that 
the proteins may act in concert or participate in the same pathway. 
5 Analysis of mouse Nitl ESTs revealed that some transcripts lack exon 2 

and encode a 323 amino acid protein. An altemative transcript containing exon 2 
encodes a shorter, 290 amino acid protein starting with the methionine 34 (Fig. 1). 

Analysis of human ESTs and 5' RACE products from HeLa and testis also 
suggested altemative processing. To investigate this, a series of RT-PCR 

10 experiments was carried out. Fig. 5 shows the results obtained from HeLa RNA 
(similar results were obtained using RNAs from the MDA-MB-436 breast cancer 
cell line and adult liver). The alternatively spliced transcripts are shown on Fig. 
3B. Transcript 1, lacking exon 2, was represented by several ESTs in the Genbank 
EST database. This transcript probably corresponds to the -1.2-kb transcript 

15 observed on Northern blots in adult muscle and heart. Transcript 2 encoding the 
327 amino acid Nitl protein (Fig. 1) is a major transcript of human NITl at least in 
the cell lines tested. This transcript lacks exons la and lb. Transcript 3 has exon 
la and lb; transcript 4 has exon la but lacks exon lb (Fig. 3B). It is not known if 
transcript 5 (lacking exon 2) starts from exon 1 or Ic. 

20 The altemative initiating methionines of different transcripts are shown on 

Fig. 3B. Data suggest that at least in COS-7 cells transfected with a construct 
containing transcript 2, the methionine in exon 3 (shown in transcripts 1 and 3, 
Fig. 3B) initiates more efficiently than the methionine in exon 2 (Fig. 3B, 
transcript 2). 

25 

Discussion 

Although the frequent loss of Fhit expression in several common human 
cancers is well documented, and results supporting its tumor suppressor activity 
30 have been reported, the role of Fhit in normal and tumor cell biology and its 
mechanism of its action in vivo are unknown. The ApsA hydrolytic activity of 
Fhit seems not to be required for its tumor suppressor fimction, and it has been 
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suggested that the enzyme- subtract complex is the active form of Fhit. To 
facilitate an investigation of Fhit function, a model organisms approach was 
initiated by cloning and characterization of D. melanogaster and C. elegans Fhit 
5 genes. 

Sxjrprisingly, in flies and worms, Fhit is expressed as a fusion protein with 
the Fhit domain fused into a 'TS[it" domain showing homology to plant and 
bacterial nitrilases. Human and murine NITl genes were further isolated. Nit and 
Fhit are expressed as separate proteins in mammals but, at the mRNA level, are 

1 0 coordinately expressed in mouse tissues. 

In several eukaryotic biosynthetic pathways multiple steps are catalyzed by 
multifunctional proteins containing two or more enzymatic domains. The same 
steps in prokaryotes frequently are carried out by monoenzymatic proteins that are 
homologs of each domain of the corresponding eukaryotic protein. For example, 

15 Gars, Gart and Airs are domains of the same protein in D. melanogaster and 
mammals. These domains catalyze different steps in de novo synthesis of purines. 
In yeast, Gart homolog (Ade8) is a separate protein and Gars and Airs homologs 
(Ade5 and Ade7) are domains of a bienzymatic protein; in bacteria, all three 
homologs (PurM, PurN and PurD) are separate proteins. De novo pyrimidine 

20 biosynthesis illustrates a similar case. Recently, a fusion protein of a lipoxygenase 
and catalase, both participating in the metabolism of fatty acids, has been 
identified in corals. In all of these examples, if domains of a multienzymatic 
protein in some organisms are expressed as individual proteins in other organisms, 
the individual proteins participate in the same pathways. This observation and the 

25 fact that Fhit and Nitl exhibit almost identical expression pattems in murine 
tissues suggest that Fhit and Nitl participate in the same cellular pathway in 
mammalian cells. 
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WHAT IS CLAIMED IS: 

1 . A purified NITl gene . 

5 

2. The gene of claim 1 which is a human gene. 

3. The gene of claim 1 which is a mammalian gene. 
10 4. A purified Nitl protein. 

5. The protein of claim 4 which is a human protein. 

6. A purified protein encoded by a nucleic acid having a 
1 5 nucleotide sequence consisting of the coding region of SEQ ID NO: 1 . 

7. An antibody which is capable of binding a Nitl protein. 

8. The antibody of claim 7 which is monoclonal. 

20 

9. A molecule comprising a fragment of the antibody of claim 
7, which fi-agment is capable of binding a Nitl protein. 

10. An isolated nucleic acid of less than 100 kb, comprising a 
25 nucleotide sequence encoding a Nitl protein. 

11. The nucleic acid of claim 10 in which the Nitl protein is a 
human Nitlprotein. 

30 12. A pharmaceutical composition comprising a therapeutically 

effective amount of a Nitl protein; and a therapeutically acceptable carrier. 
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13. A method of treating or preventing a disease or disorder in a 
subject comprising administering to said subject a therapeutically effective amount 
of a molecule that inhibits Nitl function. 

5 

14. A method of treating or preventing a disease or disorder in a 
subject comprising administering to said subject a therapeutically effective amount 
of a molecule that enhances Nitl function. 

10 15. A method of diagnosing or screening for the presence of or a 

predisposition for developing a disease or disorder in a subject comprising 
detecting one or more mutations in NITl DNA, RNA or Nitl protein derived from 
the subject in which the presence of said one or more mutations indicates the 
presence of the disease or disorder or a predisposition for developing the disease or 

15 disorder. 

16. A method of treating or preventing a disease or disorder in a 
subject by using a vector containing the NITl gene coding sequence. 

20 



25 



30 
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ABSTRACT 

The present invention relates to nucleotide sequences of the NITl gene and 
5 amino acid sequences of its encoded proteins, as well as derivatives and analogs 
thereof. Additionally, the present invention relates to the use of nucleotide 
sequences of NITl genes and amino acid sequences of their encoded proteins, as 
well as derivatives and analogs thereof and antibodies thereto, as diagnostic and 
therapeutic reagents for the detection and treatment of cancer. The present 
10 invention also relates to therapeutic compositions comprising Nitl proteins, 
derivatives or analogs thereof, antibodies thereto, nucleic acids encoding the Nitl 
proteins, derivatives, or analogs and NITl antisense nucleic acids, and vectors 
containing the NITl coding sequence. 
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OTID 
[Strarid] 

1 GCCCACTCGC TGCGGCCTNT CTGGCTCCAG ACCGCCCTCC GGATCGGACC CTGCGAATGG 
PLA AA? LAPD RPP DRT LRMV 
61 TTTTGGCTAT ATCTTGATOr AGGRCCTACT CCCTATCCCG OCGGCCGCGG CTGGGCTTCA 
LAI SSC RTYS LSR RPR LGFI 
121 TCACCAGGCC TCCTCACAGA TTCCTGTCCC TTCTGTGTCC TGGACTCCGG ATACCTCAAC 
TRP PHR FLSL LCP GLR IPQL 
181 TCTCAGTACT TTGTGCTCAG CCCAGGCCCA GAGCCATGGC TATCTCCTCT TCCTCCTGCG 
SVL CAQ PRPR AMA ISS SSCE 
241 AACTGCCCCT GGTGGCTGTG TGCCAGGTAA CATCGACGCC AGACAAGCAA CAGAACTTTA 
LPL VAV CQVT STP DKQ QNFK 
301 AAACATGTGC raAGCTGGTT CGAGAGGCTG CCAGACTGGG TGCCTGCCTG GCTTTCCTGC 
TCA ELV REAA RLG ACL AFLP 
361 CTGAGGCATT TCACTTCATT GCACGGGACC CTGCAGAGAC GCTACACCTG TCTGAACCAC 
EAF DFI ARDP AET LHL SEPL 
421 TGGGTGGGAA ACTTTTGGAA GAATACACCC AGdTGCCAG GGAATGTGGA CTCTGGCTGT 
GGK LliE EYTQ LAR EGG LWLS 
481 CCTTGGGTCG TTTCCATGAG CGTGGCCAAG ACTGGGAGCA GACTCAGAAA ATCTACAATT 
LGG FHE RGQD WEQ TQK lYNC 
541 GTCACGTGCT GCTCAACAGC AAAGGGGCAG TAGTGGCCAC TTACAGGAAG ACACATCTGT 
HVL LNS KGAV VAT YRK THLC 
601 GTCACGTAGA GATTCCAGGG CAGGGGCCTA TGTGTGAAAG CAACTCTACC ATGCCTGGGC 
DVE IPG QGPM CES NST MPGP 
661 CCAGTCTTCA GTCACCTGTC AGCACACCAG CAGGCAAGAT TGGTCTAGCT GTCTGCTATG 
SLE SPV STPA GKI GLA VCYD 
721 ACATGCGGTT CCCTGAACTC TCTCTGGCAT TGGCTCAAGC TGGAGCAGAG ATACTTACCT 
MRF PEL SLAL AQA GAE ILTY 
781 ATCCTTCAGC TTTTGGATCC ATTACAGGCC CAGCCCACTG GGAGGTGTTG CTGCGGGCCC 
PSA FGS I/TGP AHW EVL LRAR 
841 GTGCTATCGA AACCCAGTGC TATGTAGTGG CAGCAGCACA GTGTGGACGC CACCATGAGA 
AIE TQC YVVA AAQ CGR HHEK 
901 AGAGAGCAAG 1TATGGCCAC AGCATGGTGG TAGACCCCTG GGGAACAGTG GTGGCCCGCT 
RAS YGH SMVV DPW GTV VARC 
961 GCTCTCAGGG GCCAGGCCTC TGCCTTGCCC GAATAGACCT CAACTATCTG CGACAGITGC 
SEG PGL CLAR IDL NYL RQLR 
1021 GCCGACACCT GCCTGTGTrC CAGCACCGCA GGCCTGACCT CTATGGCAAT CTGGGTCACC 
RHL PVF QHRR PDL YGN LGHP 
1081 CACTGTCITA AGACTTGACT TCTGTGAGTT TAGACCTGCC CCTCCCACCC CCACCCTGCC 
LS. DLT SVSL DLP LPP PPCH 
1141 ACTATGAGCT AGTGCTCATG TGACTTGGAG GCAGGATCCA GCCACAGCTC CCCTCACTTG 
YEL VLM .LGG RIQ AQL PSLG 
1201 GAGAACCTTG ACTCTCTTGA TGGAACACAG ATGGGCTGCT TGGGAAAGAA ACTTTCACCT 
EP. LS. WNTD GLL GKE T'FT. 
1261 GAGCITCACC TGAGGTCAGA CTGCAGTTTC AGAAAGGTGG AATTTTATAT AGTCATTGTT 
ASP EVR LQFQ KGG ILY SHCL 
1321 TATTTCATGG AAACTGAAGT TCTGCTGAGG GCTGAGCAGC ACTGGCATTG AAAAATATAA 
FHG N.S SAEG .AA LAL KNII 
1381 TAATCATAAA GTCAAAAAAA AAAAAAAAAA AAAAAA 
IIK SKK KKKK K 



Fig. 6 



Please type a plus sign (+) inside this box — > [+] ^ PTO/SB/01 (12-97) 

Approved for use tlirougli 9/30/00. 0MB 0651-0032 
^ , . . Patent and Trademarl< Office; U.S. DEPARTIVIENT OF COIVIMERCE 

under the Papenwork Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains 
a valid OMB control number. 



+ 



DECLARATION FOR UTILITY OR 
DESIGN 
PATENT APPLICATION 
(37 CFR1.63) 



Q Declaration 
Submitted 
with Initial 
Filing 



□ Declaration 
OR Submitted after Initial 
Filing (surcharge 
(37 CFR 1.16(e)) 
required) 



Attorney Docket Number 



First Named Inventor 



CROOl.NPOOl 



CROCE, Carlo M. 



COMPLETE IF KNOWN 



Application Number 



Filing Date 



Group Art Unit 



Examiner Name 



/ 



July 20, 1999 



As a below named inventor, I hereby declare that: 

My residence, post office address, and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural 
names are listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled: 

NTTRILASE HOMOLOGS 



the specification of which 
m 

^ is attached hereto 
OR 

□ was filed on (MM/DDATYY) 



(Title of the Invention) 



as United States Application Number or PCT International 



Application Number 



and was amended on (MM/DDA'YYY) 



J (if applicable). 



I hereby state that I have reviewed and understand the contents of the above Identified specification, including the claims, as 
amended by any amendment specifically referred to above. 

1 acknowledge the duty to disclose information which is material to patentability as defined in 37 CFR 1.56. 



hereby claim foreign priority benefits under 35 U.S.C. 119(a)-(d) or 365(b) of any foreign applicatlon(s) for patent or inventor's 
certificate, or 365(a) of any PCT international application which designated at least one country other than the United States of 
America, listed below and have also identified below, by checking the box, any foreign application for patent or inventor's certificate 
or of any PCT international application having a filing date before that of the application on which priority is claimed. 



Prior Foreign Application 


Country 


Foreign Filing Date 


Priority 


Certified Copy Attached? 


Number(s) 


(MM/DD/YYYY) 


Not Claimed 


YES 


NO 








□ 


□ 


□ 








□ 


□ 


□ 








□ 


□ 


□ 








□ 


n 


□ 



Zl Additional foreign application numbers are listed on a supplemental priority data sheet PTO/SB/02B attached hereto: 



I hereby claim the benefit under 35 U.S.C. 119fe) of any United States provisional application(s) listed below. 



Application Number(s) 



60/093,350 



Filing Date (MIVi/DD/YYYY) 



07/20/1998 



I I Additional provisional application 
numbers are listed on a 
supplemental priority data sheet 
PTO/SB/02B attached hereto. 



+ 



[Page 1 of 2] 

Burden Hour Statement: This form is estimated to take 0.4 hours to complete. Time will vary depending upon the needs of the 
individual case. Any comments on the amount of time you are required to complete this form should be sent to the Chief Information 
Officer, Patent and Trademarl< Office, Washington, DC 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS 
ADDRESS. SEND TO: Assistant Commissioner for Patents, Washington, DC 20231, 



Please type a plus sign (+) inside lliis box 



I 1 PTO/SB/01 (12-97) 

I + I Approved for use through 9/30/00. 0MB 0651-0032 

Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE 
Under the Papenwork Reduction Act of 1995. no persons are required to respond to a coilection of information unless it contains 
a valid 0MB control number. 



+ 



DECLARATION — Utility or Design Patent Application 



I hereby claim the benefit under 35 U.S.C. 120 of any United States app!ication(s), or 365(c) of any PCT intemational application designating the 
United States of America, listed below and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
United States or PCT Intemational application in the manner provided by the first paragraph of 35 U.S.C. 1 12, 1 acknowledge the duty to disclose 
information which is matenal to patentability as defined in 37 CFR 1 .56 which became available between the filing date of the prior application 
and the national or PCT intemational filing date of this application. 



U.S. Parent Application or PCT Parent 
Number 



Parent Filing Date 
(MM/DD/YYYY) 



Parent Patent Number 
(if applicable) 



□ Additional U.S. or PCT international application numbers are listed on a supplemental priority data sheet PTO/SB/02B attached hereto. 



As a named Inventor, I hereby appoint the following registered practi tioner(s) to prosecute this app lication and to tra nsact all business in the Paten t 
and Trademark Office connected therewith: Q Customer Number 

OR 



El Registered practitioner(s) name/registration number listed below 



Place Customer 
Number Bar Code 
I 3hff/ hftm 



Name 



Registration 
Number 



Name 



Registration 
Number 



Clifford Kent Weber, Esq. 



42,215 



□ Additional registered practitioner(s) named on supplemental Registered Practitioner Information sheet PTO/SB/02C attached hereto. 



Direct all correspondence to: □ Customer Number 

or Bar Code Label 



OR EI Correspondence address below 



Name 



Clifford Kent Weber, Esq. 



Address 



Thomas Jefferson University - Office of University Counsel 



Address 



1020 Walnut Street - Suite 620 



City 



Philadelphia 



State 



PA 



ZIP 



19107-5587 



Country 



USA 



Telephone 



(215) 503-0757 



Fax 



(215) 923-3613 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such willful false statements may jeopardize the validity of the 
application or any patent Issued thereon. 



Name of Sole or First Inventor: 



n A petition has been filed for this unsigned inventor 




D Additional inventors are being named on the supplemental Additional lnventor(s) sheet(s) PTO/SB/02A attached hereto 



[Page 2 of 2] 



